LOOP-ERASED RANDOM WALK ON FINITE GRAPHS AND THE 

RAYLEIGH PROCESS 



JASON SCHWEINSBERG 

Abstract. Let (Gn)5?Li be a sequence of finite graplis, and let Yj be tiie length of a loop-erased 
random walk on Gn after t steps. We show that for a large family of sequences of finite graphs, 
which includes the case in which G„ is the d-dimensional torus of size-length n for d > 4, the 
process {Yt)tLo, suitably normalized, converges to the Rayleigh process introduced by Evans, 
Pitman, and Winter. Our proof relies heavily on ideas of Peres and Revelle, who used loop- 
erased random walks to show that the uniform spanning tree on large finite graphs converges to 
the Brownian continuum random tree of Aldous. 



1. Introduction 

The loop-erased random walk is a process obtained from a random walk by erasing loops in 
chronological order. More precisely, given a sequence of points A = (uq, ui, . . . , itj), which we 
can think of as the first j -|- 1 points visited by some random walk, we define the loop-erasure 
LE{X) to be the sequence (vq, . . . , Vk) obtained inductively as follows. First set vq = uq. Suppose 
vq, . . . ,Vm have been defined for some m > 0. If f m = Uj, then k = m and Vm is the last vertex 
in the sequence LE{X). Otherwise, define Vm+i = Ur+i, where r = max{i : Ui = Vm}- We denote 
the number of points in LE(\), which in this example is fc -|- 1, by \LE{X)\. We call \LE{X)\ the 
length of the loop-erased path. 

The loop-erased random walk was first studied in 1980 by Lawler [7], and the model has con- 
tinued to receive attention in recent years, in part because of connections with uniform spanning 
trees that were discovered by Pemantle [12j and Wilson p!6]. For the loop-erased random walk 
on Z"' with d > 5, Lawler [7] showed that a positive fraction of the vertices never get erased, so 
if the random walk is run for time n, then the length of the loop-erased path is also of order n, 
and the loop-erased random walk as a process converges to Brownian motion. The loop-erased 
random walk on also converges to Brownian motion, as shown by Lawler in [8|, but there is a 
logarithmic correction to the length of the path. If the random walk is run for time n, the length 
of the loop-erased walk was shown by Lawler [lOj to be of the order n/(log n)^/^. The loop-erased 
random walk behaves much differently in dimensions two and three, but there has been recent 
progress in these lower dimensions. Kenyon [5j showed that the length of the loop-erased random 
walk on is of order n^/*, while Lawler, Schramm, and Werner [llj showed that the loop-erased 
random walk on Z^ converges to the Schramm-Loewner Evolution (SLE) with parameter k = 2. 
Kozma [6] established the existence of a scaling limit for the loop-erased random walk on Z^, but 
the form of the limiting process remains unknown. In this paper, we consider the behavior of the 
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loop-erased random walk on large finite graphs. We will focus especially on the d-dimensional 
torus of side length n, which we denote by = {0, 1, . . . , n - l}'^, for d > 4. 

Given a finite connected graph G„ = (V^, En), write v E Gn if u is a vertex of Gn and let \Gn\ 
denote the number of vertices of G„. Write v ^ w ii the vertices v and w are connected by an 
edge. Throughout the paper, we will assume that Gn is vertex transitive. Therefore, every vertex 
has the same degree, which we denote here by d. Let {Xt)^Q be a discrete-time Markov chain 
taking its values in Vn such that 

{1/2 ifv = w 
l/2d if vr^w 
otherwise. 

That is, at each step the Markov chain stays at its current vertex with probability 1/2 and 
otherwise moves to a randomly chosen neighboring vertex. This process is often called the lazy 
random walk, as opposed to the simple random walk which never stays at its current vertex. 
Fix a vertex o £ Gn to be the starting point for the random walk, and denote the transition 
probabilities of the random walk by pt^ni^) = P{Xt = x\Xq = 6). Since G„ is vertex transitive, 
the stationary distribution of the random walk is given by n{x) = l/\Gn\ for all x G Gn- Because 
the lazy random walk is aperiodic, the distribution of Xt converges to the stationary distribution 
as t — > oo. Denote the uniform mixing time of the random walk, that is, the mixing time measured 
in terms of the separation distance, by 

1' 



(1) Tn = min <j t : sup 



< 

- 2 



7r(x) 

It is well-known that on Z^, there exist positive constants Ci and C2 such that for all n, 
(2) Gm^ <Tn< G2n^ 

(see, for example, the calculations in chapter 5 of [2j). For the rest of the paper, we will work 
with a sequence of vertex-transitive, finite graphs {Gn)^=i such that 



lim \Gn\ = 00. 

n— >oo 



We will consider two cases. 



Case 1: The graphs {Gn)'^=i satisfy the following conditions: 
• There is a constant C such that 

LIG„|i/2j 

sup sup ^ (t + l)ptAx) < G. 



n xeGn 



There is a 5 > such that 
(3) lim ^^^7- — ^ = 0. 



n^oo 

4 



Case 2: For all n, we have G„ = Z„. 

The conditions for case 1 are precisely the conditions assumed in \Xd\ by Peres and Revelle, 
who used loop-erased random walks to show that the scaling limit of the uniform spanning tree 
on these graphs as n — > 00 is the Brownian continuum random tree of Aldous As pointed 
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out in [13], this family of graphs includes the d-dimensional torus for d > 5, the complete 
graph on n vertices, the hypercubes Z2 , and expander graphs. Schweinsberg [H] showed that the 
scaling limit of the uniform spanning tree on is also the continuum random tree. The results 
in this paper will hold for the four-dimensional torus as well as for the family of graphs studied 
by Peres and Revelle, but at times in the proofs the two cases will be treated separately. 

Our goal is to study how the length of the loop-erased random walk on G„ evolves over time. 
On WJ^ for d > 5, it is known that the length of the loop-erased random walk grows linearly 
in time; see Theorem 7.7.2 of [9j, and see also Theorem 7.7.5 of |9j for a similar result when 
d = 4. The reason is that the random walk does not make long loops, so loop erasure is a local 
procedure. The linear growth then comes from the fact that most points have approximately the 
same probability of never being erased. For the tori with d > 4 and for the other graphs 
satisfying the conditions of case 1, the random walk also makes long loops. The long loops occur 
on a time scale much longer than the mixing time. As a result, when the random walk makes a 
long loop, the point on the path that the random walk hits is approximately uniformly distributed 
over all the points in the path. Therefore, when the long loop is erased, the length of the path 
is multipied by a fraction which is approximately uniformly distributed between and 1. As on 
Z'^, the length of the path grows approximately linearly between the times when these long loops 
form. 

As a result of this intuition, Jim Pitman conjectured that the length of the loop-erased random 
walk on G„ converges to a process called the Rayleigh process as n ^ 00. The Rayleigh process 
{R{t),t > 0) was introduced by Evans, Pitman, and Winter [4]. The process grows linearly at 
unit speed between jumps. At time t, jumps occur at rate R{t—), and at the times of jumps, 
the value of the process gets multiplied by a random variable which is uniformly distributed on 
[0, 1]. More formally, the Rayleigh process can be constructed from a Poisson point process 11 
on [0, 00) X [0, 00) whose intensity measure is Lebesgue measure. Given y > 0, we can obtain a 
Rayleigh process started from R{0) = y hy defining 

(4) R{t) = {y + t)A inf{x + {t - s) : {s,x) e U,0 < s < t}. 

This means that when (t, x) is a point of this Poisson process and R{t—) > x, there is a jump at 
time t and R{t) = x (see Figure 1 below). It was shown in Proposition 8.1 of [4J that the stationary 
distribution for this process is the Rayleigh distribution, where we say a random variable W has 
a Rayleigh distribution if P{W > x) = for all x > 0, and that for any starting point y, 

the distribution of R{t) converges to the Rayleigh distribution as t — > cx). For more about the 
Rayleigh process, see section 8 of [1]. 

It is already known (see [l3] for case 1 and [I5] for case 2) that if x and y are vertices of G„ 
chosen uniformly at random, then the distribution of the length of the loop-erased random walk 
started at x and run until it hits y, suitably normalized, converges to the Rayleigh distribution 
as n — > 00. The theorem below, which was conjectured by Pitman, is a dynamical result, which 
shows that the length of the loop-erased random walk converges to the Rayleigh process, in the 
sense of Skorohod convergence for processes whose sample paths are right continuous and have 
left limits. When Gn is the complete graph with n vertices, this result can be obtained from 
Corollary 8.2 of [4]. 

Theorem 1.1. Let R = {R{t),t > 0) denote the Rayleigh process with R{0) = 0. Let {Gn)'^=i 
be a sequence of graphs satisfying the conditions of either case 1 or case 2. Let {Xt)'^Q he a 
lazy random walk on Gn, as defined above. For all t, let Yt be the length of the loop-erased 
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X 




Figure 1: The Rayleigh process 



path LE{{Xs)1^q). Then there exist sequences of constants {an)'^=i and (6n)5^i satisfying < 
inf a„ < supa„ < oo and < inf 6„ < sup6„ < oo such that if we define the continuous-time 
process Zn = {Zn{t), t > 0) by 



then Zn ^ R in the Skorohod topology as n ^ oo. 

Remark 1.2. We work with the lazy random walk rather than the simple random walk because 
our proof uses results about the mixing time which require the random walk to be aperiodic. 
However, once convergence to the Rayleigh process is established for the lazy random walk, it 
follows easily for the simple random walk with a„ replaced by a„/2. 

Remark 1.3. By the results in section 8 of [13], when G„ = for some d > 5, there are positive 
numbers a and b such that lim^^oo a„ = a and lim„_^oo &n = b. It follows from the definition of 
the Skorohod metric that we still have Z„ — > in the Skorohod topology if we take an = a and 
bn = b for all n. 

Remark 1.4. To understand the scaling in case 2, note that the appropriate time scale for 
convergence to the Rayleigh process is the time scale on which long loops occur, as these long 
loops correspond to the jumps of the Rayleigh process. Given a random walk segment of length L, 
if one removes a segment of length r„ from the middle, then the probability that the two remaining 
segments intersect (which would correspond to a long loop) is the same order of magnitude as 
the probability that two independent random walk segments of length L on started from the 
uniform distribution intersect. The expected number of intersection times between two such walks 
is LP' /n^. Furthermore, if there is one intersection, then there will likely be O(logn) intersections 
because two independent random walks of length n in started at the origin will intersect 
O(logn) times (see Proposition 3.2.3 of [9j). Consequently, the intersection probability of two 
random walks of length L on started from the uniform distribution is of order L^/(n^logn) 
(see Propositions 2.8 and 2.9 of [l5]), which is of order one when L is of order n^(logn)^/^. This 
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explains the time scaling in case 2. The spatial scaling results from the fact that the number of 
points remaining after loop-erasure is of order n^(log n)^/^/(log n)-*^/^ = n^(logn)^/^. 

There are two steps to proving Theorem 11.11 First, we must show the convergence of finite- 
dimensional distributions. That is, we need to show that if < ti < t2 < ■ ■ ■ < tk, then 
(Zniti), . . . , Zn{tf:)) converges weakly to . . . , This is done in section [2j Then, we 

must show that the sequence of processes {Zn)'^=i is relatively compact, which we do in section 
[3l These results imply that Z„ ^ in the Skorohod topology (see Theorem 7.8 in chapter 3 of 

m)- 

2. Convergence of finite-dimensional distributions 
Fix A: G N, and fix times < ti < t2 < • • • < ife- Our goal in this section is to prove that 

(5) . . . , Zn{tk)) {R{h), R{tk)). 

The proof proceeds in three steps. First, we review some results concerning the loop-erased 
random walk on Gn- Next we set up a coupling between the loop-erased random walk and the 
Rayleigh process. The result ([5]) will then follow from some bounds for the Rayleigh process. 

2.1. Loop-erased random walk on G„. To study the loop-erased random walk on G„, we 
follow the strategy introduced by Peres and Revelle in [13] of splitting the random walk into 
shorter segments. The ideas are the same in cases 1 and 2, but it is necessary treat the two cases 
separately. Some of the minor differences in the treatments of the two cases could be avoided, 
but we prefer to set up the notation so that we can directly invoke results in [13] and [15| . 

First, consider case 1. Following [T3], let r = Lr^^lGnp/sj and s = [Tn^^\Gn\^/^ \. Note that 
([3]) implies that s is much smaller than r when n is large. We will work with segments of the 
random walk whose length is approximately r. For all positive integers i, let Ai = {{i — l)r -|- 
2s -|- 1, (i — l)r + 2s + 2, . . . ,ir — s}, which will be the set of times associated with the ith segment. 
Say that a time u is locally retained if LE{{Xt)^^^^^^Q D = 0. Let U denote the 

set of times u € Ai that are locally retained. Still following |13j . define the local loop erasure of 
the segment Ai by LEs{Ai) = {Xt)teAinu ■ Denote by \LEs{Ai)\ the cardinality of At n U, which 
is the length of the path LEs{Ai). For any V C G„, define the capacity of the set V by letting 
(Wt)'^Q be a random walk on G„ started from the stationary distribution vr and then defining 

(6) Cap(y) = p((m)Lonv^/0). 

As in [13J, define the constants 7^ = r~^E[\LEs{Ai)\] and q„ = r'''^\Gn\E[Cap{LEs{Ai))]. As 
can be seen from Lemma 5.3 of [l3], the sequences of constants (a^)^! and (7n)5^i are bounded 
away from zero and infinity. 

For case 2, let r = [n^(log n)^/^^J and let Ai = {{i — l)r, (i — l)r + 1, . . . ,ir — 1} for all positive 
integers i, as in [15j. By Corollary 3.2 of [15], we can define a sequence of constants (7n)J^i, 
bounded away from zero and infinity, such that for some positive constants G and C, we have 

\LE{{Xt)teAj\-lnn\lognf/'' ^ ' ^ 



(7) P 



> 



(logn)i5/44j - (logn)2/ii 



for all n. The exponent of 5/66 = 9/22 — 1/3 comes from the fact that in four dimensions, the 
length of the loop erasure of a random walk of length r is of the order r/(logr)^/^. Also, fix a 
small positive number t/ > 0, and let w = [n2(log n)''J . For V C Z^, define Cap(F) as in ([6|) but 
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with r — 2w in place of r. By Proposition 3.8 of [15j, there is a sequence of constants {an)'^=i, 
bounded away from zero and infinity, such that for some positive constant C, we have 



P 



Cap{LE{{Xt)teA,)) 



Or 



(logn)2/ii 



> 



< 



C 



(log n)^/22y (logn)^/^^"'' 



These bounds show that the length and capacity of the loop-erased segment LE{{Xf)t(zAi) are 
highly concentrated around their means. 

In the absence of long loops, the ith segment of the random walk of length r, after loop erasure, 
looks approximately like the path LEs{Ai) in case 1 and like the path LE((Xt)t£Ai) in case 2. 
However, long loops can cause entire segments of length r to get erased. We will use indicator 
random variables to keep track of the long loops. In case 1, for i < j, let lij be the indicator of 
the event that LEs{Ai) n {Xt)t£Aj / O- In case 2, let = for all i and, for i < j — 1, let 

Jjj- be the indicator of the event that LE{{Xt)t£Ai) H {Xt)teAj 7^ O- Thus, in both cases, when 
Jjj- = 1, the jth segment of the random walk of length r intersects the loop-erasure of the ith 
segment. For both cases, let Sq = {0} and, for j > 1, let 

Sj = {ke Sj-i : lij = for all i e {I, . . . , k} n Sj-i} U {j}. 

Here Sj consists of the indices of the segments that are not erased in the loop erasure through 
time jr, with the convention that if segment j intersects the loop-erasure of segment i, causing 
segments i + 1, . . . , j — 1 and parts of segments i and j to be erased, we keep j in Sj but not i. 

The number of segments not erased through time jr is \Sj\, where | Sj \ denotes the cardinality 
of Sj. Since \LEs{Ai)\ has mean 7„r in case 1 and \LE{{Xt)t£Ai)\ has mean approximately 
7^71^ (log n)^/^^ in case 2, the length Yjr of the loop-erased random walk at time jr can be 
approximated by 7„,r|5j| in case 1 and by 7,in^(logn)^/^^|5j| in case 2. More precisely, we have 
the following result. 

Lemma 2.1. Let (T„)^| be a sequence of fixed times such that for some constants Ci and C2, 
we have Tn < CilGnl^^"^ for all n in case 1 or Tn < C2n2(log n)^/^ for all n in case 2. Let 



a 



1/2^,-1 



7„ . Then, there are positive constants C and C' such that for all n, 



P 



P 



>T„-7n^'(logn)5/66|5pr^/, 



&,^^n2(logn)i/6 



>C"(logn)-i/22 



< 



C(log logn)' 
(log n)i/22 



in case 1, 



in case 2. 



Proof. Equation (41) in section 6 of [13j gives the estimate for case 1. Although (41) in [13J is 
stated for certain random times rather than fixed times, the arguments leading to this result 
(which show, for example, that \LEs{Ai)\ is highly concentrated around its mean, and that the 
contributions of the gaps of length 3s between the segments Ai can be neglected) hold for fixed 
T as well. 

The result for case 2 follows from d?]), and from Proposition 3.10, Proposition 4.1, and Lemma 
4.11 of [15]. Proposition 3.10 of [15] shows that on the complement of an event whose probability 
is shown in Proposition 4.1 of [T5] to be at most C(log n)~^/^^, the only discrepancies between 
\LE{{Xt)^^Q)\ = Itjv a-nd 7„n^(log n)^/^^|S'|-j'^/r] I come from the following three sources: 
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• There are differences, for i G S^x„/r~\^ between \LE{{Xt)t£Ai)\ and 7^72^ (log n)^/®^. By 
([7D, on the complement of an event of probability at most CfT^/r] (log n)~^/^^, which is 
of order (logn)"^/^^, the sum of the absolute values of these differences can be bounded 
above by C \Tn/r]n? {logn)~^^/^^ , which is of order n^(logn)~^^/^^+-^/^^ = n^(logn)~^/'*. 

• There are differences, for i £ S^Tn/r]^ between \LE{{Xt)t£Ai)\ and the number of points 
from the ith segment of the random walk that end up in the path LE{{Xt)fZo)- ^y 
Proposition 3.10 of [E], these differences add up to at most 2\S^Tn/r] \w, which is of order 
n2(logn)i/ii+''. 

• There are contributions from segments that get only partially erased because of intersec- 
tions. By Lemma 4.11 of [15j, on the complement of an event whose probability is of or- 
der (loglogn)^(logn)^^/^^, there are at most (logn)^/^^ such contributions, each of order 
n^(logn)^/^^, so the total contribution is of order n^(log n)^/^^+^/^^ = n^(logn)-'^/^^-'^/^^. 

These observations give the result for case 2. □ 
Let an = an^^"^, and let bn = al/'^j~^ as in Lemma [2?T] Let 

{ral/'^\Gn\~^^'^ in case 1 
ay^(logn)~^/^-'^ in case 2. 



Here dn is an estimate of the length of an individual loop-erased segment after the rescaling in 
Theorem 11.11 Recall that our goal is to find the limit in distribution of {Zn{ti), . . . , Zn{tk))- 
The next corollary, which follows immediately from Lemma 12.11 relates Zn{ti) to the sets Sj 
by showing that Zn{ti) can be approximated by the product of \St*\ (the number of segments 
retained) and dn (the length of a segment). 

Corollary 2.2. For i = 1,...,A:, let t* = \Tn/r'\, where Tn = [an\Gn\^^'^ti\ in case 1 and 
Tn = [a,in^ (log n)^/^fjj in case 2. There exist positive constants C and C , depending on k, such 
that for all i = 1, . . . , k and all n, we have 

P{\Zn{ti) - dn\St'^\ \ > \Gn\"^'^^) < C|G„|-3^/i6 in casc 1, 

P{\Zn{t.) - dn\S,,\\ > C'(logn)-V22) < ^^Qgl^g^)' ,ase 2. 

II (logn)^/^^ 

In particular, in both cases, 

(8) (Z„(ti) — dn\Sq\, . . . ,Zn{tk) — dn\St*J) -^p 0, 

where — >p denotes convergence in probability, using the Euclidean metric on . 

2.2. Coupling with the Rayleigh process. We now show how the loop-erased random walk 
on On can be coupled with the Rayleigh process. It will suffice to couple the sets Sj. We begin 
by reviewing the coupling between the loop-erased random walk on G„ and loop-erased random 
walk on the complete graph used in [13] and [E]. Let m = [r~^a~^|G„|] = [d^^] in case 1, 
and let m = [a~"'^(logn)^/^"'^J = [d~'^\ in case 2. Let be an i.i.d. sequence of random 

variables which have the uniform distribution on {1, . . . ,m}. The process can be viewed 

as a lazy version of a random walk on the complete graph with m vertices in which on each step 



8 



JASON SCHWEINSBERG 



the random walk stays in its current position with probability 1/m. For i < j, let lij = 
Let 5*0 = {0} and, for j > 1, let 

Sj = {ke Sj-i : iij = for ah i G {1, . . . , A;} n Sj-i} U {j}. 

Here Sj consists of the vertices of the walk that are not erased after j steps. 

The lemma below shows that the sets Sj can be coupled with the sets Sj with high probability. 
This coupling is possible because the jth segment {Xt)t£Aj is approximately equally likely to 
intersect any of the previous segments. This is true because the mixing time r„ is much shorter 
than the length of the segments (a consequence of ([3]) in case 1 and ([2]) in case 2), so conditional 
on the previous segments, the jth segment is at approximately a uniform random point a short 
distance into the segments. The proof of Lemma 6.3 of [13] gives the bound for case 1, while the 
bound for case 2 comes from Proposition 4.7 of [15]. The results in [13] and [15] are stated for 
the case in which the random walk is run for a random time rather than the fixed time f^, but 
the same proofs work for fixed times. 

Lemma 2.3. Define as in Corollary \ 2. 21 There exists a coupling of {Sj)Jl^i and {Sj)JLi such 
that for some constant C , we have 



f 1 - C\G 

P{Sj = Sj for a// j = 1, . . . , 4) > I ^ _ ^^j^^ „)-i/22+^ 



1 -C|G„|-'^/i6 in easel 

in case 2. 



Our next step is to couple the Sj with a Rayleigh process. Let 11 be a Poisson point process 
with Lebesgue intensity on [0, oo) x [0, oo). For all t > 0, define Rt by @ with y = 0. For 
< i < j, let /• J be the indicator of the event that there is at least one point of 11 in 

(9) [dnU - l),dnj) X [dn{i - l),dni) 

Let S'q = {0}. For j > 1, conditional on S'-^ = {ii, . . . ,£ig{ i}, where ii < • • • < £15/ 1, define 

S'j = {4 G S'j_^ : I'ij = for all i < k} U {j}. 

Because the rectangles in ([9]) have area and are disjoint, the indicator random variables 
are independent and equal 1 with probability 1 — e~'^^. Also, conditional on Sj-i = 
{£1, . . . , ^|}, the probability that Ijj = for all i G Sj-\ is 1 — |5j_i|/m and, for all h < \Sj-i\, 
the probability that li^^j = 1 and lij = for all i £ Sj-i with i ^ h is 1/m. Consequently, 
we will be able to couple the 5^ and Sj by using the following elementary lemma. Because this 
result is a special case of Lemma 4.5 in |15| . we omit the proof. 

Lemma 2.4. Suppose < p < 1/ j and < q < 1. Suppose Vi, . . . ,Vj are random variables such 
that P{Vi = for all i) = 1 —jp and P{Vi = 1 and = for £ ^ i) = p for i = 1, . . . , j . Suppose 
Wi, . . . , Wj are independent random variables such that P{Wi = 1) = q and P{Wi = 0) = 1 — q. 
Then there is a coupling of Vi, . . . ,Vj and Wi , ■ ■ ■ , Wj such that 

P{Vi = Wi for all i)>l- j\p -q\- j{j - l)q^ 
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Lemma 2.5. Define as in Corollary \2. 21 There exists a coupling of {Sj)'jLi and {Sj)'jLi such 
that, for some constant C , we have 



P{S'^ = Sjfor allj = l,...Xk)> 



C(logn) 



-1/11 



in case 1 
in case 2. 



Proof. Let p = 1/m and q = 1 — e~°'". Note that |»S'j_;^| < j and jSj-il < j for all j. By 
Lemma \ZM conditional on Sj_i = Sj-i, a coupling can be achieved such that the probability 
that Sj = Sj is at least 1 — j\p — q\ — j{j — 1)^^- Therefore, there is a coupling such that 

P{S'j = Sj for all i = 1, . . . , 4) > 1 - {j\p -q\- j{j - l)q^) 

i=i 

(10) >l-{tlf\p-q\-{tlfq\ 

For all a; > 0, we have < e"^ - 1 + x < x"^/! and < 1/x - < l/x - l/{x + 1) < l/x^. 

Also, if X > 2, then < l/[x\ - l/x < l/{x - 1) - l/x < 2/x^. Therefore, we have in case 1 

3dt 



\p-q\ 



1 



\dn 



(1 



< e 



-dl 



1 + C I + 



di 



1 



-2i 



and in case 2 for n large enough that > 2, 



\p-q\ 



1 



\dr, 



(1 



< e 



-dl 



1 + CI + 



\dn 



d'^ 



d"^ 



5< 
2 



Also, = (1 - e 



-dl^2 



< . By combining these bounds with ([TO]) , we can bound the probability 



that S'- 7^ Sj for some j < tl in both cases by 



2(4)X + (tDX. 

Because there is a constant C such that < Cr^^|G„|^''^ in case 1 and < C(logny^^^ in case 



2, the result follows. 



□ 



Remark 2.6. As another way of understanding the scaling, let Lj be the cardinality of Sj, 
which is the length of the loop-erased random walk on the complete graph Km after j steps. 
Conditional on Lj, with probability 1 — Lj/m the next step of the walk will not form a loop and 
we will have Lj^i = Lj + 1, and for k = 1,. . . ,Lj, with probability 1/m the next step of the 
walk will duplicate the A;th vertex on the current path, and after the loop erasure we will have 
Lj = k. Thus, 



E[Lj+i\Lj] 



m 



k=l 



2m 



2m ' 



and so E[L 



'i+iJ 



1 + E[Lj] - E[Lj]/2m - E[Lf\/2m. Therefore, letting x = \\mj^ooE[Lj], we 

have X ~ 1 + x — x/2m — x^ /2m, and since xP' is much larger than x, it follows that x ~ \/2m, 
where the approximations only give the correct order of magnitude because x^ is being used to 
approximate ElLj]. Thus, in the long run, the loop-erased paths should contain on the order of 
m segments, which in case 1 is the order of d^^ and in case 2 is of order (logn)^/^^. Loop-erased 
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segments have length of order r in case 1 and of order n^(logn)^/^^ in case 2, so the length of 
the loop-erased path should be of order r/dn, which is of order |G„|^/^, in case 1 and of order 
n^(log n)^/^^(log n)^/^"*^ = (logn)^/^ in case 2, consistent with the scaling in Theorem II .![ 

2.3. Bounds for the Rayleigh process. Corollary 12.21 bounds the process Z„ using the sets 
Sj, and Lemmas 12.31 and 12.51 couple the sets Sj and the sets S'y In this subsection, we complete 
the proof of ([5]) by using the sets S'j to obtain bounds for the Rayleigh process R. Recall that 
both the Rayleigh process and the sets Sj are constructed from the same Poisson process H. We 
begin by obtaining a bound which is valid at times that are integer multiples of d„. This is a 
deterministic bound which follows from the construction of R. 

Lemma 2.7. For all nonnegative integers j, we have \R{dnj) — dn\Sj\ \ < dn- 

Proof. We proceed by induction on j. The result is trivial for j = 0. Assume, for some integer 
J > 1, the result holds for j — 1- We consider two cases. 

First, suppose I- j = for alH < |S'^_;^|. Then \Sj\ = \Sj_i\ + 1. By the definition of the I- j, 
there is no point of 11 in [dn{j — l),dnj) x [0,dn\Sj_-^\). It follows that 

mm {dn\S'j_^\,R{dn{j - l))+dn} < R{dnj) < R{dn{j " 1)) + C^n- 

Now the induction hypothesis gives dndS^I — 1) < R{dnj) < dnd/Sjl — 1) + 2dn, which implies 
the claim. 

Alternatively, suppose there is an ^ < |5'j_i| such that I'^j = 1 and I'^ j = for i < i. Then 
\Sj\ = i. There is no point of 11 in [dn{j — 1), dnj) x [0, dn{i — 1)) but there is a point of 11 in 
[dn{j — ^),dnj) X [dnia. — l),dni)- Therefore, one can see from the construction that 

(inl^- 1) < R{dnj) < dni + dn, 

and again the claim follows. □ 
Lemma 2.8. There is a constant C such that for all n and for i = 1, . . . ,k, we have 

P{\R{t,) - dn\S',,\\ > 3d„) < j ^'f"'"'l' /n i 
^' ' S'l ^ 6(logn) ^/^^ m case 2. 

In particular 

(11) {dn\S[,\,. ■ ■,dn\S[,^\) -^p . . .,R{tk)). 



Proof. From the definitions of t* and d^, we get dn{tl — 2) < tj < dn{t* + 1) in both cases for 
large enough n. For all < s < t, we have R{t) < R{s) + (t — s). Using this fact for the first 
inequality and Lemma 12.71 for the second, we get 

R{U) < R{dn{t* - 2)) + 3dn < dn\S't,_^\ + Adn. 

i 

Likewise, this time using Lemma 12.71 for the first inequality, 

dn\S't,+l\ - idn < R{dn{t* + 1)) - Mn < 

Therefore, on the event that 

(12) \S[,_,\+2 = \S[,\ = \S'r.+,\-l, 
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we have 

\R{ti) - dn\S't*\\ <3dn. 

Thus, it remains only to bound the probabiUty that ()12p fails to occur. However, the event (I12p 
occurs as long as I[ ■ = Q whenever j G {t* — l,t*,t* + 1} and i <t* + 1. Recall that the random 

variables I[ ^ are nonzero with probability 1 — e~'^^ < c?^, so the probability that (I12p fails to hold 
is at most 3(t* + The bounds in the lemma now from the definition of dn and the fact that 

there is a constant C such that < Cr~^ in case 1 and t*^ < C(logn)^/^^ in case 2, and 

the convergence in probability follows easily from these bounds. □ 

Proof of Recall that if (S, d) is a metric space and and v are probability measures on the 
Borel cr-field of (S", d), then the Prohorov distance between ^ and v is defined by 

v) = inf{e > : ^(F) < i^(F^) + e for ah closed subsets F C 5}, 

where F"" = {x ^ S : d{x,y) < e for some y £ S}. It is well-known that if X,Xi,X2, ■ ■ ■ are 
5"- valued random variables with distributions fi, ni, ^2, ■ ■ ■ respectively, then Xn -^d X as n — > 00 
if and only if p{fin, fJ') — > as n — > 00 (see, for example. Theorem 3.1 in chapter 3 of [3]). 

Now consider the metric space M*^ with the Euclidean metric. Let ^„ and denote the dis- 
tributions of {dn \St*\, ■ ■ ■ , dn\St*\) and (d^ {SLl, . . . , dn\SL\) respectively. Lemmas 12.31 and 12.51 
give that f^) — > as n — > 00. Convergence in probability implies convergence in distri- 

bution (see, for example. Corollary 3.3 in chapter 3 of [3]), so if denotes the distribution of 
{R{ti), . . . , R{tk)), then p{i'n, ^ as n ^ 00 by (fTT]l . Combining these results gives 

(13) {dn\Sq\,...,dn\Stl\) -^d {R{h),...,Ritk)) 

By writing Zn{ti) = (ZniU) — dn\St*\) + dn\St* | for i = 1, ... A;, we can combine ([8]) and (fT3l) with 
Slutsky's Theorem (see Theorem 8.6.1 in [14J) to obtain ([5|). □ 



3. Relative compactness 

To complete the proof of Theorem 11.11 it remains to show that the sequence of processes 
{Zn)^=i is relatively compact. The sequence of processes {Zn)^=i is relatively compact if the 
following two conditions hold (see Corollary 7.4 in chapter 3 of [3j): 

• For all e > and t >0, there is a compact set K such that 

liminf P(Z„(t) £ K)>l-e. 

n— >oo 

• For all e > and T > 0, there is a ^ > such that 

limsup P(t(;(Z„, 0, T) > e) < e, 

n— >oo 

where 

(14) w(Z„, ^, T) = inf max sup |Z„(t) — Z„(n)| 

and {ti} ranges over all sequences (ij)E^o with m > 1 such that = to < ■ ■ ■ < *m-i < 
T <tm and mini(tj — tj-i) > 9. 
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The first condition follows immediately from the convergence in distribution of Zn{t) to R{t). 
Therefore, our goal in this section is to establish the second condition. 
For t > 0, define 

[a„|G„|"^/^tJ in case 1 

[a„n^ (log n)^/^tj in case 2. 



9it) 

Also, let 

L 

and 



O'TilGnl^^'^ in case 1 

" \ anV? (log ny/^ in case 2 



in case 1 
(logn)V*^ m case Z. 

We now choose the points (ti)^Q. Suppose T, 9, and e are fixed. Let = 0. For positive integers 
i, let 

(15) t'^ = mf{t > + e : = Xg(^t^_, G LE{{XXti'^) for some v > M„e}. 

This means that at the time g{t'j), the random walk (Xf)t>o completes a long loop, causing 
a downward jump in the process {Zn{t),t > 0) at time t[. Since > t'-_i + 6* for all i, we 
can choose the points (ti)^o *o coincide with the t^, and add additional points in the gaps 
between the when the gaps have length greater than 29. More precisely, it is possible to choose 
= to < ■ ■ ■ < tm-i < T < tm such that 6 < ti — ti^i < 26* for z = 1, . . . , m and, if t'j < T, then 
t'j = ti for some i. Note that with this construction, if ti does not equal t'- for some j, then the 
open interval [ti — can not contain any of the t'y 

We will use the sequence (ti)^o *° upper bound the right-hand side of (fill) . lit<u, then we 
need both an upper bound for Zn{u) — Zn{t), which will show that the loop-erased path does not 
grow too quickly, and an upper bound for Zn{t) — Zn{u), which will show that the loop-erased 
path does not decrease in length too quickly. The bounds for Z^ (u) — Zn (t) , provided in Lemmas 
13.11 and [3.31 are relatively straightforward. Obtaining an upper bound for Zn{t) — Zn{u) is more 
difficult because the length of the loop-erased random walk jumps down at the times when the 
random walk makes long loops. However, because the points t[ are chosen to coincide with these 
jumps as long as they are far enough apart, the right-hand side of (jl4p will only be large if the 
process Z^ makes two jumps within a time interval of length 9. This will typically happen only 
if the random walk makes two nested long loops within a short time, as we show in Lemma |3.5[ 
and we will bound the probability of getting two long loops within a short time in Lemma 
Note that Lemmas 13. 1|, 13. 5| and 13.61 imply Property 2 in case 1 , while Lemmas 13. 3[ 13. 5| and 
imply Property 2 in case 2. Theorem 11.11 follows . 

3.1. Bounding the growth of the loop-erased walk. For case 1, the necessary bound on 
the growth of the loop-erased random walk follows easily from the fact that the length of the 
loop-erased walk can grow by at most one on each time step. 

Lemma 3.1. In case 1, there is a constant C such that if e > and 9 < e/C, and if ti-i < t < 
u < ti, then Zn{u) — Zn{t) < e. 

Proof If ti-i < t < u <ti, then n - t < 26* and Ygt^^) - Yg^^t) < g{u) - g{t) < 1 + 2an\Gn\^/'^9 . 
Therefore, 

Zn{u) - Zn{t) = hn\Gn\-^'HYg(^) - Yg^t)) < hn\Gn\'^'^{l + 2an\Gn\''^9) < G9 
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for some constant C, which imphes the lemma. □ 

For case 2, this trivial bound is insufficient, and we must make use of the fact that the length 
of the loop-erasure of a random walk segment of length r is typically of order r/(logr)^/^. Recall 
that Ai = {{i - l)r, . . . ,ir - 1}. Let i = \g{T + 29) /r]. Let ?7 > 0, and let w = [n2(log n)*?] . 
It follows from the proof of Proposition 4.1 of [15] that for sufficiently large n, outside of an 

event of probability at most C{logn)~^^^^ , the random walk {Xt)f^'^'^^^ satisfies the following 
two properties. Recall that t„ is the uniform mixing time of the random walk, defined in ([1]). 

• Every interval [t, t + w] with < t < g(T + 26) — w contains a local cutpoint, that is, a 
point u such that {Xu-2t„,- ■ • n {X^+i, ■ ■ ■ ,Xu+2t„} = 0- 

• For all i < i, if s,t e Ai with s + 2r„ < t, then Xs / Xt. 

To define the time indices retained after loop-erasure, for positive integers u < v, let cTq'^ = 
max{t G [u,v] : Xt = For i > 1, if <t^_i"^ < v, let a^'^ = max{t € [c'"^_l"^,u] : Xt = X^-^,v^j^{\. 

Let W{u,v) be the set of times o""'^, so the path LE{{Xt)t^^) consists of the points {Xt)t£W{u,v)- 
We then have the following result. 

Lemma 3.2. Suppose the two conditions above hold. Then for all t < g{T + 29) and all j < I, 

we have \W{0,t)nAj\ < \LE{{Xt)teA,)\ + w + 2Tn. 

Proof. We use ideas from the proofs of Lemma 3.9 and Proposition 3.10 in [15]. Note that 
I VF(0, t) r\Aj\ =0 for t < {j — l)r, and 1 1— > |VF(0, t) n Aj\ is nonincreasing for t > ji — 1 because 
after time jr — 1, indices in Aj can no longer be added but can be lost due to loop erasure. 
Therefore, we may assume that {j — l)r < t < jr — 1. Let 

z = mm{u > (j - l)r : X^ X^ for all v G [u,t],s G {0,1,... ,(i - l)r - 1} n 1^(0, u - 1)}, 

which is the first time during the jth segment of length r after which there are no more in- 
tersections involving earlier segments. Then W{0,t) n Aj = W{z — l,t) because the indices 
(j — l)r, . . . , z — 2 get erased at time z — 1. By the first property above, if z — 1 < g{T + 26) — w, 
the interval [z — 1, z — 1 + w] has a local cutpoint, which we call u. We also have 

wiU - i)r,t) n{u,...,t} = wiz- i,t) n{u,...,t} 

because between times u and t, there are no loops involving points before time u, as the first 
property prohibits short loops and the second property prohibits loops of length longer than 2Tn. 
Therefore, 

(16) \W{0,t)nAj\ < \W{{j-l)r,t)n{u,...,t}\+w, 

as any discrepancy between W{z — 1, t) and W{{j — l)r, t) must come from the interval [2: — !, 1], 
which has length at most w. It follows that 

\W{{j-l)r,t)n{u,...,t}\ < \W{{j-l)r,t)\ 

< \W{{j-l)r,jr-l)\+2Tn 

(17) =\LE{{Xt)teA,)\ + 2Tn, 

where the second inequality holds because, by the second property above, no point before time 
t — 2Tn can be erased between times t and jr — 1, while the equality is just the definition of w. 
The result follows from ([HD and (HT]). □ 
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Lemma 3.3. In case 2, there is a constant C such that if e > and 6 < e/C, then for sufficiently 
large n, 

P(max sup {Zn{u) — Zn{t)) > e) < e. 

* ti-l<t<U<ti 

Proof. By d?]) and Lemma [3^ and the fact that is O(n^) by ([2|), there are constants Ci and 
C2 such that with probabihty at least 1 - Ci(logn)^/^\ we have \W{0,t)nAj\ < C2n'^{logn)^/^^ 
for all t < g{T + 26) and all j < \g{T + 26) /r] . Suppose < t < u < ti for some i. We have 

Z„(n) - = M-i(y,(„) - y,(i)) < M-^(|t^(o, <7(^x)) n {^(t) + 1, . . .,g{u)}\). 

Now u — t< 26, so 5f(u) — < 1 + 2a„n^(logn)^/^0. It follows that there is a constant C3 such 
that the number of segments Aj of length r = [n^(logn)^/^^J that intersect [g{t) + l,g{u)] is at 
most C3(log n)^/^^0. Therefore, with probability at least 1 — Ci{logn)^^^^^ , we have 

Zn{u) - Zn{t) < • C3(logn)l/"0 • C2n2(logn)5/66 ^ C2C3bn6. 

The result follows by choosing C > C2C3 sup^ 6fi and n large enough 

that Ci (log n)-i/ii < e. □ 

3.2. Bounding the probability of nearby long loops. We now work towards bounding the 
probability that Zn{t) — Zn{u) > e if < t < u < ti for some i < m. The next two lemmas 
show that we typically only have Zn{t) — Zn{u) > e if the random walk makes two long loops 
within a short time. 

Lemma 3.4. If ti^i < t < u < ti and Zn{t) — Zn{u) > e, then = t'j for some j. 

Proof. If Zn{t) — Zn{u) > e, then ^(j) — Yg(u) ^ M^e. This is only possible if a portion of the 
path at time g{t) of length at least M„e is erased by time g{u). This means that there are integers 
vi and V2 satisfying vi < g{t) — Mne and g{t) < V2 < g{u) such that X^^ = S -^-£'((-'^s)sLo^)- 
Let X = min{y : g{y) = V2}. By (fT5]) . either x = t'j or t'j < x < t'j + 6 for some j. 

Proceeding by contradiction, suppose we do not have ti-i = t'- for some j. Then, as previously 
observed, the interval (tj-i — 6,ti) does not contain any of the t'y However, this contradicts the 
facts that t'j < x < t'j + 6 and < x < ti. □ 

Lemma 3.5. Let e > 0. Let A be the event that there exist integers vi,V2,V3,V4 such that 
= Xy^ , Xy,j = , and the following hold: 

(a) V2 > vi + \Mne. 

(b) V-i>V2 + Mne. 

(c) V-i<Vi<V^ + l + 2Ln6. 

(d) Vi < g{T + 26). 

In cases 1 and 2, there is a constant C such that if 6 < e/C, then for sufficiently large n, 
P{{ max sup {Zn{t) - Zn{u)) > e} n A") < e. 

Proof. Suppose ti-i < t < u < ti iov some i < k, and suppose Zn{t) — Zn{u) > e. Suppose also 
that 

(18) max sup {Zn{y) - Zn{x)) < ^. 

* ti^i<x<y<ti ^ 

By Lemmas 13.11 and 13.31 it suffices to show that under these assumptions, the event A occurs. 
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By Lemma Em there is a j such that = t'j. Set vs = g{t'j)- By (fT5]) . there is an integer 

V2 such that (b) holds and X^^ = X^^ £ LE({Xs)^IIq^). This means that the portion of the walk 
between times V2 and V3 is erased when the loop forms at time v^. Since Zn{t) — Zn{u) > e, we 
have Yg(^t) - Ygiu) > M„e. Also, ^ gives Yg(^t) - ^^3 = Mn{Zn{t) - Zn{t'j)) < ^Mne. Therefore, 

Ygiu) = Yv, + (Yg^t) -Y.,) + (Yg^u) - Yg^) < + ^ - Mne < Y,, - 

It follows that between times V2, and g{u)^ a portion of the walk of length at least ^M„e must 
get erased. Since the portion of the walk between times V2 and fs was already erased at time 
v^, this can only happen if there is some time ^4, with f 3 < W4 < g{u), such that X^^ = X^-^ e 
LE{{Xu,Y^~q) and vi < V2 — \Mne. Thus, (a) holds. Also, (c) holds because ^4 — ^3 < g{u) — v^ < 
g{ti) - g{ti-i) < l + 2Ln9. Finally, (d) holds because V4 < g{t) < g{tk) < g{T + 29). We conclude 
that A occurs. □ 

Lemma 3.6. Let e > 0, and define the event A as in Lemma \3.5l In cases 1 and 2, there is a 
constant C such that if 9 < e/C, then P{A) < e for sufficiently large n. 

Proof. Let 5n = [^Mne\. For positive integers k such that (A; — < g{T + 29), let 

Bk = {{k - l)6n, {k - l)5n + 1, . . . , M„ - 1}. 

Also, for such fc, and for all nonnegative integers £ such that {k + l)5n + ^Ln9 < g{T + 29), let 

Dk/ = {{k + l)5n + [iLn9\ , (k + l)6n + [iLn9\ + 1, . . . , {k + l)6n + [{i + 3)L„^J + 1}. 

Note that if A occurs, there must exist integers j, k, and i with k > j + 2 and i > such that 
vi G Bj, V2 £ Bk, V3 G Dk/, V4 G Dk/, X^^ = X^^, and X.,j^ = ^^,3. In particular, 
intersects both {Xt)t(^Bj and {Xt)t(^Bk- 

The number of positive integers k such that [k — < g{T + 29) is at most 1 + g{T + 29) /5n, 
and the number of nonnegative integers i such that lLn9 < g{T+29) is at most 1+ g{T +29) / Ln9 . 
Therefore, there is a constant Ci such that the number of triples (j, k, i) that we must consider 
is at most 

g{T + 29) yf g{T + 2e) \ f L^ V f L^ \ C^Lj 

. 5n )\ L^9 J-'^\M^ejW9j M^e^9- 

In cases 1 and 2, when k > j + 2 and ^ > 0, for sufficiently large n there is a gap between 
Bk and Bj of length greater than t„ and a gap between Bk and Dk/ of length greater than 
Therefore, in case 1, it follows from Lemma 5.2 and equation (8) of [13] that there is a constant 
C2 such that the probability that [Xt)t^Dui intersects both {Xt)t&Bj and {Xt)t&Bk is at most 
C2{Ln9)'^{Mne)'^ /\Gn'\^ ■ Likewise, in case 2, it follows from Proposition 3.3 of |15j that there is 
a constant C3 such that the probability that {Xt)t<^Dk e intersects both (Xt)teBj and {Xt)teBk is 
at most C3(-L„0)2(M„e)2/(n^(log n)2). Putting together these bounds, we see that in case 1, we 
have 

P(A)<rr( \( LlM^nO'e' \_ C^C29Lt ^ 

for some constant C4. Likewise, in case 2, we have 

p(A\<:rr ( ( ^nMl9'^e^ \ _ CiC^9Li 
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for some constant C5. These bounds imply the lemma. □ 
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